Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval

ثبت نشده
چکیده

Here we offer a more detailed description of the proposed model to facilitate re-implementation. A schematic illustration of the network architecture of the proposed model can be found in Fig. 2 of the main paper. It shows that the model is a Siamese triplet network with three branches of identical architectures and shared parameters. In this section, we further describe the detailed network architecture for each branch. Table 1 shows that each network branch has 7 convolutional layers and 2 fully connected layers. The first 5 convolutional layers are the same as the Sketch-a-Net [5] and the last 2 convolutional layers are part of the proposed attention module. Note that the output of the attention module is a 7× 7 attention mask which is used to re-weight the feature map of pool5. The attention module shortcut connection (see Sec. 3.2 of the main paper) takes place before Layer No. 12 and the coarse-fine fusion shortcut connection occurs before Layer No. 14 (as detailed in Sec. 3.3 of the main paper). The final output of each branch is a 512D feature vector which is then subjected to our HOLEF loss.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Multi-task Attribute-driven Ranking for Fine-grained Sketch-based Image Retrieval

With touch-screen devices becoming ever more ubiquitous, sketch holds great promise as an intuitive and efficient mode of input compared to classic alternatives. This has motivated a major revival of interest in vision-based analysis of sketches, notably in sketch-based image retrieval (SBIR). Superior to classic SBIR methods, finegrained SBIR (FG-SBIR) methods [1] are proposed to make fine-gra...

متن کامل

Instance-Level Coupled Subspace Learning for Fine-Grained Sketch-Based Image Retrieval

Fine-grained sketch-based image retrieval (FG-SBIR) is a newly emerged topic in computer vision. The problem is challenging because in addition to bridging the sketch-photo domain gap, it also asks for instance-level discrimination within object categories. Most prior approaches focused on feature engineering and fine-grained ranking, yet neglected an important and central problem: how to estab...

متن کامل

Fine-grained sketch-based image retrieval by matching deformable part models

An important characteristic of sketches, compared with text, rests with their ability to intrinsically capture object appearance and structure. Nonetheless, akin to traditional text-based image retrieval, conventional sketch-based image retrieval (SBIR) principally focuses on retrieving images of the same category, neglecting the fine-grained characteristics of sketches. In this paper, we advoc...

متن کامل

Fine-Grained Image Retrieval: the Text/Sketch Input Dilemma

Fine-grained image retrieval (FGIR) enables a user to search for a photo of an object instance based on a mental picture. Depending on how the object is described by the user, two general approaches exist: sketch-based FGIR or text-based FGIR, each of which has its own pros and cons. However, no attempt has been made to systematically investigate how informative each of these two input modaliti...

متن کامل

Query-adaptive Image Retrieval by Deep Weighted Hashing

The hashing methods have attracted much attention for large scale image retrieval. Some deep hashing methods have achieved promising results by taking advantage of the better representation power of deep networks recently. However, existing deep hashing methods treat all hash bits equally. On one hand, a large number of images share the same distance to a query image because of the discrete Ham...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017